Commentary on Steinley and Brusco (2011): recommendations and cautions.

نویسنده

  • Geoffrey J McLachlan
چکیده

I discuss the recommendations and cautions in Steinley and Brusco's (2011) article on the use of finite models to cluster a data set. In their article, much use is made of comparison with the K-means procedure. As noted by researchers for over 30 years, the K-means procedure can be viewed as a special case of finite mixture modeling in which the components are in equal (fixed) proportions and are taken to be normal with a common spherical covariance matrix. In this commentary, I pay particular attention to this link and to the use of normal mixture models with arbitrary component-covariance matrices.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

K-means may perform as well as mixture model clustering but may also be much worse: comment on Steinley and Brusco (2011).

Steinley and Brusco (2011) presented the results of a huge simulation study aimed at evaluating cluster recovery of mixture model clustering (MMC) both for the situation where the number of clusters is known and is unknown. They derived rather strong conclusions on the basis of this study, especially with regard to the good performance of K-means (KM) compared with MMC. I agree with the authors...

متن کامل

Choosing the number of clusters in Κ-means clustering.

Steinley (2007) provided a lower bound for the sum-of-squares error criterion function used in K-means clustering. In this article, on the basis of the lower bound, the authors propose a method to distinguish between 1 cluster (i.e., a single distribution) versus more than 1 cluster. Additionally, conditional on indicating there are multiple clusters, the procedure is extended to determine the ...

متن کامل

Clusterwise p* models for social network analysis

Clusterwise p∗ models are developed to detect differentially functioning network models as a function of the subset of observations being considered. These models allow the identification of subgroups (i.e., clusters) of individuals who are ‘structurally’ different from each other. These clusters are different from those produced by standard blockmodeling of social interactions in that the goal...

متن کامل

The p-median model as a tool for clustering psychological data.

The p-median clustering model represents a combinatorial approach to partition data sets into disjoint, nonhierarchical groups. Object classes are constructed around exemplars, that is, manifest objects in the data set, with the remaining instances assigned to their closest cluster centers. Effective, state-of-the-art implementations of p-median clustering are virtually unavailable in the popul...

متن کامل

A note on 'Exact and approximate methods for a one-dimensional minimax bin-packing problem' [Annals of Operations Research (2013) 206: 611-626]

In a recent paper, Brusco, Khn and Steinley [Ann. Oper. Res. 206:611-626 (2013)] conjecture that the 2 bins special case of the one-dimensional minimax bin-packing problem with bin size constraintsmight be solvable in polynomial time. In this note, we show that this problem is NP-hard for the special caseand thatit is strongly NP-hard for the general problem. We also propose a pseudo-polynomial...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Psychological methods

دوره 16 1  شماره 

صفحات  -

تاریخ انتشار 2011